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Abstract — Cloud computing has recently drawn significant at- 
tention of both academia and industry as an emerging computing 
paradigm where data, applications, or processing power are 
provided as services through the Internet connection. Cloud 
computing extends the existing computing infrastructure owned 
by the cloud service providers (CSPs) to achieve the economies of 
scale through virtualization and aggregated computing resources. 
End users, on the other hand, can reach these services through 
an elastic utility computing environment with minimal upfront 
investment. Nevertheless, pervasive use of cloud computing and 
the resulting rise in the number of data centers have brought 
forth the concerns about energy consumption and carbon emis- 
sion. Therefore, this paper addresses the problem of resource 
provisioning and task scheduling on a cloud platform under the 
service level agreement, in order to minimize the electric bills 
and maximize the profitability for the CSP. User task graphs and 
dependencies are randomly generated, whereas user requests for 
CPU and memory resources are extracted from Google cluster 
trace. A general type of dynamic pricing scenario is assumed 
where the energy price is both time-of-use and total power 
consumption-dependent. A negotiation-based iterative approach 
has been proposed for the resource provisioning and task schedul- 
ing that is inspired by a Field-Programmable Gate Array (FPGA) 
routing algorithm. More specifically, in each iteration, decisions 
made in the previous iteration are ripped-up and re-decided, and 
the concept of congestion is introduced to dynamically adjust the 
resource provisioning decisions and the schedule of each task 
based on the historical results as well as the current status. 
Experimental results demonstrate that the proposed algorithm 
achieves up to 63.52% improvement in the total energy price 
compared to baseline. 

I. Introduction 

Cloud computing is an emerging computing paradigm in 
which hosted software, platforms and infrastructure are de- 
livered as services through pooled resources, geographic di- 
versity and universal connectivity [1]. Virtualization, which 
is recognized as the fundamental technology that powers 
cloud computing, enables cloud service providers (CSPs) to 
make more efficient use of hardware resources such as data 
centers and server clusters through facilitating a greater degree 
of abstraction of the software environment [1], [2]. With 
virtualization technology, end users, who are incentivized by 
the savings of reducing or eliminating the costs associated 
with hardware, software or licensing fees, can rent a variety of 
virtual machines (VMs) from CSP based on the requirements 
of their own applications (jobs/tasks). On the other hand, CSP 
achieves economies of scale and makes profits by charging 



a time-based or usage-based fee from a large number of 
end users. There are many cloud computing platforms in 
the market such as Google App Engine (GAE) [3], Amazon 
Elastic Compute Cloud (EC2) [4], and Microsoft Windows 
Azure Platform [5]. 

Service-Level Agreement (SLA) is a service contract ne- 
gotiated and established among CSP and end users, which 
documents privacy, security, and the agreed-upon levels of 
availability, serviceability, compensation in the event of a con- 
tract breach, and performance metrics like deadline (delivery 
time of service). CSP should consistently meet the agreements 
written in the SLA and meanwhile aim to increase market 
share, growth and profitability through reducing operational 
expenses. It is widely known that the bulk of the CSP operating 
cost comes from the electric bills of data centers where large 
groups of disks, servers, and routers are networked. In the 
U.S., data centers consumed an estimated 91 billion kWhs 
of electricity in 2013, equivalent to the annual output of 
34 large (500 megawatt) coal-fired power plants [6]. As a 
further matter, data center electricity consumption is projected 
to increase to roughly 140 billion kWhs annually by 2020, 
costing companies 13 billion U.S. dollars annually in electric 
bills and emitting nearly 100 million metric tons of carbon 
pollution per year [6]. This necessitates the development of 
efficient management techniques for CSPs to reduce the energy 
consumption in data centers. 

According to [7], most of today’s servers have two im- 
portant energy usage features: (i) servers tend to be signif- 
icantly more inefficient under low levels of utilization than 
being exercised at the optimal utilization rate (around 70- 
80% depending on the servers), and (ii) servers may have 
significant power consumption during idle periods. Therefore, 
server consolidation and load balancing can be applied to 
achieve a high performance-to-power ratio through selectively 
shutting down idle servers and improving the utilization levels 
in active servers. Of note, with various advances in server 
energy conservation technologies, high-end modern servers are 
able to cut down the active idle power to less than 20% of 
their peak power consumption under peak load condition [8]. 
Nevertheless, such servers are extremely expensive and the 
majority of servers in data centers are previous models or latest 
low/medium-end models, which are still suffering from the 
energy inefficiency caused by server idle power consumption. 





Fig. 1: An example benchmark result for SPECpower_ssj2008: 
(a) typical low/medium-end model: Hewlett-Packard Company 
Proliant DL580 G5 and (b) typical modern high-end model: 
Dell Inc. PowerEdge M830 

In this sense, server consolidation and load balancing are 
more attractive solutions than huge investments in high-end 
servers. Figure 1 (a) and (b) provide the energy usage profiles 
for a typical low/medium-end server and a latest high-end 
server model, respectively. The high-end model is much more 
efficient than the low/medium-end model during idle or low 
utilization periods, whereas the price of the high-end model is 
about 28X of the low/medium-end model. 

Considerable research efforts have been conducted in the 
context of improving energy efficiency in cloud data centers 
using server consolidation and load balancing. The authors 
in [9] propose a hybrid particle swarm optimization with an 
improved ant colony algorithm to achieve a good trade-off 
between the rate of convergence and the scale of global search 
space, however, workloads and cloud platform are assumed 
to be homogeneous. To consider heterogeneous characteristics 
in both workloads and servers, the authors in [10] propose a 
heterogeneity-aware resource management system for dynamic 
capacity provisioning in cloud computing environments. In 
order to achieve the overall efficiency with the awareness of 
other competing users and the entire cloud resource map, the 
authors in [11] develop a genetic algorithm which deals with 
resource provisioning, VM placement, and task scheduling in 
a holistic fashion. However, the genetic algorithm in [11] have 
scalability problem. 

In this paper, we consider the resource provisioning and 
task scheduling problem for a CSP that owns a number 
of heterogeneous server farms, in order to minimize the 
CSP’s electric bills under dynamic pricing. A general type 
of dynamic pricing policy is considered in this paper, which 
is comprised of a time-of-use (TOU) price and a power 
consumption-dependent price. Users request VMs using the 
Pay-As-You-Go option (which is available in most cloud 
computing platforms such as Amazon EC2 [4] and Microsoft 
Azure [5]), and the workloads of users are modelled as a 
collection of multiple job graphs with internal dependencies. 
The server farms comprise a diverse mix of computing and 
data resources, and the entire cloud platform is modelled as 
a weighted graph where communication among servers incurs 
extra delay. In the Pay-As-You-Go agreement, users do not 
need to schedule tasks or architect data storage. Instead, CSP 



determines the scheduling as well as the allocation of physical 
resources to host the VMs such that the overall efficiency 
is maximized while stipulated requirements in SLA are met. 
Figure 2 provides the overview of the cloud computing envi- 
ronment. The workloads sent by users will first go through an 
admission control policy which filters the jobs that are unable 
to meet deadlines even with the best available computing 
resources allocated. Next, CSP, which is assumed to have 
abundant VM resources to accommodate users’ requests, will 
configure VMs for the remaining tasks and schedule the tasks 
using a resource provisioning and task scheduling algorithm. 

In order to solve the aforementioned problem efficiently, we 
develop a novel negotiation-based cloud resource provisioning 
and task scheduling algorithm for CSP to fully leverage the 
flexible, scalable infrastructure and cost savings. The proposed 
algorithm adopts an iterative manner and is inspired by the 
negotiation-based FPGA routing method in [12], and this 
heuristic method has also been applied to other task scheduling 
problems [13], [14]. In order to balance the load through 
the entire scheduling period, the concept of congestion is 
introduced, which guides the scheduler to shift loads from 
high utilization time periods to low utilization periods. In each 
iteration, the proposed algorithm rips-up all the decisions that 
are made in the previous iteration, re-decides VM configu- 
ration for each task and re- schedules all tasks based on the 
degree of congestion. Congestion related terms are trained 
in each iteration and guide the scheduler to achieve overall 
energy efficiency gradually. Job graphs and task dependencies 
are randomly generated and real-world user workload requests 
for CPU and memory resources provided in Google cluster 
trace are used to conduct the experiments [15]. Experimental 
results demonstrate that the proposed algorithm can achieve 
up to 63.52% energy price reduction compared to baseline. 

II. System Model 

A. Time Model, Price Model and Workload Model 

In this paper, we adopt a slotted time model where all system 
parameters, constraints and scheduling decisions are provided 
for discrete time intervals of constant and equal length, which 
is denoted by T unit . The proposed algorithm can achieve near- 
continuous results with fine-grained time slots. 





End users send N job requests to the CSP, where each job 
a (1 < a < N) consists of a total number of n a tasks with 
dependencies. Job a is represented by a directed acyclic graph 
(DAG) G a (V a ,E a ), where each vertex T- 1 (1 < i < n a ) 
embodies a task. Of note, each task can be uniquely indexed 
by Tf, and the total number of tasks in all jobs is denoted 
by ritotai = =i n a • Edge from task Tf to task Tj* indicates 

that Tj depends on the output of Tf and the amount of data 
that needs to be fed into T ( - J from Tf is represented by the 
weight of the edge Wfj . The entire workload request can 
be represented by a set of N DAGs. Figure 3 illustrates an 
example of user workload. End users also specify a deadline 
L deadline f° r eac h job a, and we use L%(g) to denote the worst 
case execution time of task Tf with g type VM allocated. The 
analysis of worst case execution time affects the results but 
falls outside the scope of this paper. For each task Tf, the 
scheduled start time is denoted by sf. The entire scheduling 
period is denoted by L max , which equals the last finish time 
among all the tasks. A combined price model c(t,P to tai (*)) 
is considered in this paper, which is comprised of a TOU 
price determined by the time slot t and a power consumption- 
dependent price that depends on the total instantaneous power 
consumption Ptotai{l ) in the time slot t. The TOU price 
is higher in peak usage time slots than off-peak time slots, 
which incentivizes energy users to shift loads towards off-peak 
periods. The power consumption-dependent price is assumed 
to be monotonically increasing with respect to Ptotalif ), which 
penalizes peak energy consumption that can potentially affect 
grid stability and lead to blackouts, power cuts, and brownouts. 

The cloud platform supports K categories of VMs where 
tasks are executed. Each type V M g is associated with a two- 
tuple parameter set {R 9 ME m^ F 9 cpu }, which represents the 
amount of memory and CPU resources required to establish 
a V M g on a server. Each task Tf is also associated with a 
two-tuple parameter set Fcpu}’ which indicates the 

minimum amount of memory and CPU resources required to 
run task Tf. Therefore, task T- 1 can only be executed in those 
types of VMs with abundant resources (i.e., F^em — Fmem 
and F^pjj < R 9 CPU ), and we use an integer set 0% to represent 
all the types of VMs that support task T?. Accordingly, 
end users can only request the VM type g that support the 
requested task Tf, i.e., V Tf, requested VM type g G Of. 

B. Cloud Platform Model 

The cloud infrastructure of the CSP is comprised of several 
server clusters located in different data center campuses, and 





the entire cloud platform is modelled as an undirected graph 
Gcsp(D,B). There are M vertices in the vertex set D , 
which denote the M servers (Di, £> 2 , ..., Dm) in the cloud 
infrastructure. Each edge B x y embodies the link between 
server D x and D y , and the communication bandwidth of 
the link B X:V is represented by its weight b X:V . By default, 
data exchange in the same server does not incur extra delay, 
i.e., B x x = oo, whereas communication among different 
servers results in transmission overhead. Figure 4 provides an 
example of cloud platform, which consists of communication 
links and two server clusters, each with six servers. As 
mentioned in section I, we consider heterogeneous server 
farms in this paper. Each server D x is associated with a two- 
tuple parameter set {C%[ EM , C^ PU }, which represents the 
total amount of memory and CPU resources available on the 
server D x . Besides, each server D x can only support several 
VM types and an integer set ( x stores all the types of VMs 
that server D x can support. Each server is also associated with 
a K x L max VM configuration matrix Q x , where element 
Q x (g,t) at the g - th row and t - th column indicates the number 
of g type VMs allocated on the server D x in the t - th time 
slot. Obviously, the following constraints must be satisfied all 
the time: (i) Wg £ ( x and Vt e [1 ,L max ], Q x (g,t ) = 0, 
and (ii) Vt and \/x, Ti^ =1 Q x (g,t) • R 9 CPU < C^ PU and 
Q x {gP) • R 9 mem — Gmem- 

C. Energy Consumption Model and Problem Formulation 

The energy consumption in each server D x depends on the 
utilization rate U x (t ), which is the ratio of the CPU usage in 
time slot t over the total CPU resources available on server 
D x . Hence, the utilization rate U x (t ) is calculated as 

Q x (g , t) • Rrnrr 

u x (t ) = g ~ lV ^ x ioo% (i) 

° CPU 

During operation, each server D x has static power consump- 
tion p static^) and dynamic power consumption P;i ynaT , nc (t), 
and the total power consumption Pf ota i (t) is calculated as 

Ptotal (t) = Pstatic (t) + Pdynamic W (2) 

F static^) cons t an t when U x (t ) > 0, 0 otherwise. 

F dynamic^), on the other hand, is related to the optimal 
utilization level of server D x , which is denoted by Opt x . 
The Opt x of most modern servers is about 70-80%. Dynamic 
power consumption increases drastically when utilization level 
is greater than the optimal utilization rate. We adopt the 




Fig. 3: User workload model 



Fig. 4: Example cloud platform 



dynamic power consumption model developed in [11] and 
Pdynamicit) is calculated as 

a x -U x (t), if U x (t)<Opt x 

a x ■ Opt x + • (U x (t) - Opt x ) , otherwise 

(3) 

where a x and f3 x are server dependent parameters. Using the 
abovementioned models, the resource provisioning and task 
scheduling problem is formulated as follows: 

Resource Provisioning and Task Scheduling (RPTS) Prob- 
lem for CSP. 

Given price model, workload and cloud platform information. 
Find the VM configuration and start time for each task. 
Minimize the total energy price during the entire scheduling 
period: 

Lmax ,M \ 

Total Cost = £“.£ P 'total (*)) ) (4) 

t = 1 ' x=l V ' 

Subject to: 

i) ■ R 9 CPU < C X CPU , Vi and Va (5) 

Xf =1 Q x (g, t ) ■ R g MEM < C X MEM , Vi and Va (6) 

*? + L?(9) < Leadline, VI? (?) 

Pmem — R 9 mem^t: ( 8 ) 

Fcpu - R 9 cpu^T? ( 9 ) 

Q x {g,t ) = 0,W i Cx and Vi e [l, Lmax] (10) 

III. Negotiation-Based Resource Provisioning and 
Task Scheduling Algorithm 

The RPTS problem described in Section II-C is an integer 
non-linear programming problem subject to integer non-linear 
constraints. No provably optimal solutions can be derived 
in polynomial time, nor are they likely to ever be available 
for this kind of problems [16]. Therefore, in this paper, we 
invest an efficient heuristic to search for a suitable solution 
instead of the optimal one. The remainder of this section is 
organized as follows. Subsection III- A explains the motivation 
to introduce the negotiation-based routing method to the RPTS 
problem. Subsection III-B presents the proposed negotiation- 
based resource provisioning and task scheduling algorithm. 

A. Motivation to Introduce Negotiation-Based Routing 
Method to the Resource Provisioning and Task Scheduling 
Problem 

The FPGA routing process aims to connect signals properly 
according to the gate-level logic descriptions such that the 
worst case delay is minimized and the route is feasible, 
i.e., no signals share any physical wire or switch. In order 
to solve the FPGA routing problem efficiently, the authors 
in [17] proposed the Negotiated Congestion (NC) algorithm, 
where the concept of congestion is introduced to represent 
the degree of routing resource sharing. More specifically, in 
the NC algorithm, several congestion terms are integrated 



into the objective function (i.e., path delay) and the router 
is guided by these terms to avoid congested wires or switches 
while aiming to minimize the path delay. There are certain 
similarities between the FPGA routing problem and the RPTS 
problem: 

• Sharing routing resources in FPGA among signals leads 
to infeasible routing solution, and utilizing the same time 
slots among tasks on the same server result in high 
utilization and low energy efficiency. 

• In both problems, finding the optimal solution is im- 
practical due to the extremely long runtime. Hence, it 
is reasonable to seek for a suitable solution instead of 
the optimal one in both problems. 

• Considering the large scale of search space in both 
problems, one pass is not sufficient to generate a good 
enough solution. Therefore, iterative approach is more 
suitable for both problems. 

Motivated by the above similarities, we introduce the con- 
cept of congestion to the proposed algorithm to effectively 
solve the RPTS problem. 

B. Negotiation-Based Resource Provisioning and Task 
Scheduling Algorithm 

In this subsection, a Negotiation-Based Resource 
Provisioning and Task Scheduling (NBRPTS) algorithm 
is proposed to effectively solve the RPTS problem described 
in Section II-C. 

As mentioned in Section I, an iterative approach is adopted. 
The proposed algorithm finds a good enough solution when 
the total cost in Eqn. (4) has not decreased for d te rm itera- 
tions or the iteration number exceeds the maximum iteration 
number denoted by . The results of the algorithm are the 
configuration and scheduling of each task. The configuration 
determines the VM type and physical server, and the schedul- 
ing is the start time to execute the task. We define a ready task 
as a task with all its dependencies satisfied. In each iteration, 
all tasks T? (1 < a < N, 1 < i < n a ) will be ripped-up, 
i.e., decisions made in the previous iteration will be cleaned, 
and the VM configuration as well as the scheduling for each 
ready task will be re-decided one by one until all tasks are 
re-configured and re-scheduled. We denote the configuration 
and scheduling decision in the d- th iteration after scheduling 
the /-th task (1 < / < n to tai ) by Sj. Accordingly, static 
and dynamic power consumption during the configuration and 
scheduling process are functions of the configuration and 
schedule decision, which are represented by Sj) 

and Pd ynCL mi c (ti Sf), respectively. Similarly, the total power 
consumption of server D x becomes Pfi otal (t, Sj). We define 
the Cost Increase (Cl) after configuring and scheduling the 
/-th task in the d- th iteration as follows 

= E f 6 0 ’ E ( Ptotal (t,S d f )))~ 

t=a V ' x=i * 

/ M x \ 

/*’E( P ^(iAUi))j J (11) 




Clearly, for each ready task, the Sj with the minimum CI 
will be chosen. 

One important observation is that it is more likely to see 
a high utilization rate and high energy price in the time slots 
which have already been occupied by other tasks. Such time 
slots are considered as congested , and it is reasonable to avoid 
congested time slots in the rest of the iteration in order to 
minimize the total energy price. An intra-iteration congestion 
term R(t ) is introduced to quantify the degree of congestion, 
which is equal to the total number of tasks that have been 
scheduled to occupy time slot t within an iteration. Another 
important observation is that tasks configured earlier may 
have the same configuration decisions (i.e., occupy the same 
energy efficient VMs and servers) in all iterations. This may 
block other configurations scenarios where tasks configured 
later take the energy efficient VMs as well as servers and the 
overall energy cost is lower. Therefore, we introduce an inter- 
iteration congestion term hf(x, g) to indicate the total number 
of times in the previous iterations when task Tf is allocated to 
server D x with VM g . The term h%(x,g) gradually increases 
if task Tf is allocated to the same server and VM in several 
iterations, and eventually task Tf will be guided to explore 
other configuration possibilities. 

In addition to the abovementioned congestion terms, we 
introduce another inter-iteration term V a (t) to represent the 
total number of times when job a violates the deadline 
requirement in Eqn. (7) at the t-th time slot in the previous 
iterations. The time slots that are occupied by job a and cause 
deadline violation in the previous iteration will see higher 
energy costs as the term V a (t) gradually increases. Hence, 
the term V a (t) penalizes deadline violation and incentivizes 
tasks of job a to utilize time slots that do not violate the 
deadline requirement. With the aforementioned terms, in the 
d- th iteration, the Cl after configuring and scheduling the /- th 
task Tf is modified as: 

Lmax / / -AT \ 

CI '= E - 

t= 1 \ k x=t ' 

/ M x \ 

^t,J2{ P t X otal(t,SU))U • (l + Cl 

/ K M \ Lmax 

( 1 +C 2 -EE^( a: ’ 5 )) + E C 3- V a(t) ( 12 ) 

' g = 1 x=l ' t = 1 

where ci, C 2 , and C 3 are positive weights of R(t), h^(x,g) 9 
and V a (t ), respectively. After introducing the above terms, the 
NBRPTS algorithm will avoid congested time slots within 
an iteration due to the term c\ • R(t), and explore other 
configuration possibilities since the term hf(x, g ) permanently 
increases when task is configured onto server D x using the 
g type VM after several iterations. If tasks in job a occupy time 
slots which violate the deadline requirement, the term V a (t) 
will increase the energy costs and after several iterations, tasks 
in job a will give up these time slots due to the high energy 



costs caused by the term V a (t) and try to complete execution 
in early time slots. 

Initially, all the terms R(t), hf(x,g), and V a (t) are zeros, 
and the CV is the same as C/. Intra-iteration term R(t) is 
reset at the beginning of each iteration and updated after each 
decision Sj is made. Inter-iteration terms h%{x,g) and V a (t), 
on the other hand, are reset only during initiation and updated 
at the end of each iteration. The computational complexity 
of the proposed algorithm is 0{d rnax L rnax n^ otal MK). The 
n< totai term comes f rom finding each ready task and for each 
ready task, conducting configuration as well as scheduling. Al- 
gorithm 1 provides the pseudo code of the proposed algorithm. 



Algorithm 1: Negotiation-Based Resource Provisioning 
and Task Scheduling Algorithm 



1 Initialize task graphs, price model, cloud platform model and 

terms; 

2 Set iteration counter d = 0; 

3 repeat 

4 d — d -f- 1; 

5 foreach time slot t do R(t) = 0; 

6 foreach task do 

7 | rip up all configuration and scheduling decisions; 

8 end 

9 repeat 

10 Find a ready task Tf; 

11 Configure and schedule the task Tf such that CI' in 
Eqn. (12) is minimized; 

Update the intra-iteration term R(t); 
until all tasks are configured and scheduled ; 
foreach job a do 

foreach time slot t do 

Update inter-iteration term V a (t) ; 
foreach server D x do 
foreach VM type g do 
| Update inter-iteration term /i“(x,g); 
end 

end 



end 



end 



Calculate total energy price in Eqn. (4); 

25 until total energy cost has not decreased for d te rm iterations 

or d> dm ax ; 

26 return configuration and scheduling of each task; 



IV. Experimental Results 

In this section, we demonstrate the effectiveness of the 
proposed algorithm on a heterogeneous cloud platform, which 
consists of up to 4 server farms and 30 heterogeneous servers. 
Communication among servers in the same server farm is 
much faster than communication among servers in different 
server farms. We randomly generate user task graphs with 
different numbers of tasks in each job, and the dependencies 
among tasks in a job are represented using a DAG. In order 
to consider the dynamics of the jobs and resource requests 
arriving at the cluster, we extract the vector of resource request 
per task in terms of CPU and memory from the Google 
cluster dataset released in 2011 [15], which is measured on 
a heterogeneous server cluster during a 29-day period. The 




CPU and memory resource profiles of VMs are generated such 
that each VM can support only a subset of tasks, whereas 
the resources available on each server are provided so that 
each server can accommodate a limited number of VMs 
simultaneously. Table I summarizes the key parameters of the 
cloud platform and workloads. 

TABLE I: Modeling parameters summary 



Cloud Platform Parameters 


User Workload Characteristics 


Servers 

Farms 


Servers 


Total 

Users 


Task per 
User 


Task 

Latency 


2-4 


15-30 


5-20 


5-14 


2-7 



We assume the utility company provides the TOU- 
dependent price as well as the instantaneous power 
consumption-dependent price, and the power consumption- 
dependent price part is monotonically increasing with the 
CSP’s real-time total power consumption. We compare the 
proposed algorithm with a greedy baseline that schedules each 
ready task to start at the best possible time slot based on the 
cost increase function in Eqn. (11). 

We first compare the total energy cost through the entire 
29-day period (i.e., different resource request) using a fixed 
job number N = 10 and a fixed cloud platform with 4 
server farms and 30 servers. Results in Figure 5 demonstrate 
that the proposed NBRPTS algorithm consistently outperforms 
the baseline during the entire 29-day period. Next, using the 
resource request information on the first day, we sweep the 
number of jobs from 5 to 20 on the same cloud platform and 
sweep the server number from 10 to 30 using the same job 
number, in order to show the effects of workloads and server 
resources, respectively. Due to space limitation, we only show 
the normalized energy cost results in Figure 6. Of note, less 
energy cost in larger number of jobs, as shown in Figure 6 (a), 
is caused by the randomness of task and platform generation. 
The proposed NBRPTS algorithm achieves up to 63.52% 
energy cost savings compared to baseline with different job 
numbers and server numbers. Besides, the proposed algorithm 
results in deadline violations only in a few scenarios in 
the previous experiments, whereas the baseline has deadline 
violations in most cases (because the baseline algorithm does 
not consider deadline). The worst case runtime of the proposed 
algorithm is 152.62s in the previous experiments. 

V. Conclusion 

The overarching goal of this paper was to minimize the total 
energy cost for CSP under dynamic pricing, while meeting the 
deadline requirements in the SLA. An iterative negotiation- 




Fig. 5: Impact of task resource request 





Fig. 6: (a) Impact of job number and (b) impact of server 
resources 

based resource provisioning and task scheduling algorithm was 
proposed, where the concept of congestion was introduced, in 
order to guide the scheduler to avoid congested time periods 
and escape local optimum traps. The experimental results 
demonstrated that the proposed algorithm could achieve up 
to 63.52% improvement compared to the baseline. 
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